OPEN 3 ACCESS Freely available onli 



ine 



•0-PLOS I ONE 



Susceptibility to Chronic Mucus Hypersecretion, a 
Genome Wide Association Study 



CrossMark 



(D 



Akkelies E. Dijkstra 1 ' 2 , Joanna Smolonska 2 ' 3 , Maarten van den Berge 1,2 , Ciska Wijmenga 3 , Pieter Zanen 4 , 
Marjan A. Luinge 5 , Mathieu Platteel 3 , Jan-Willem Lammers 4 , Magnus Dahlback 6 , KerrieTosh 7 , Pieter S. Hiemstra 8 , 
Peter J. Sterk 9 , Avi Spira 10 , Jorgen Vestbo 11 ' 12 , Borge G. Nordestgaard 13 ' 14 , Marianne Benn 14 , Sune F. Nielsen 13 , 
Morten Dahl 15 , W. Monique Verschuren 16 , H. Susan J. Picavet 16 , Henriette A. Smit 17 , Michael Owsijewitsch 18 ' 19 , 
Hans U. Kauczor 18 ' 19 , Harry J. de Koning 20 , Eva Nizankowska-Mogilnicka 21 , Filip Mejza 21 , Pawel Nastalek 21 , 
Cleo C. van Diemen 3 , Michael H. Cho 22 ' 23 ' 24 , Edwin K. Silverman 22 ' 23 ' 24 , James D. Crapo 25 , Terri H. Beaty 26 , 
David A. Lomas 27 , Per Bakke 28 , Amund Gulsvik 28 , Yohan Bosse 29 , M. A. Obeidat 30 , Daan W. Loth 31 ' 32 , 
Lies Lahousse 31,33 , Fernando Rivadeneira 34,35 , Andre G. Uitterlinden 34,35 , Andre Hofman 31 ' 35 , 
Bruno H. Strieker 31,32 , Guy G. Brusselle 31,33 , Cornelia M. van Duijn 36 , Uilke Brouwer 2,5 , Gerard H. Koppelman 2,37 , 
Judith M. Vonk 2,38 , Martijn C. Nawijn 2,5 , Harry J. M. Groen 1 , Wim Timens 5 , H. Marike Boezen 2,38 , 
Dirkje S. Postma 1 ' 2 *, the LifeLines Cohort study' 



1 University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, the Netherlands, 2 University of Groningen, University Medical Center 
Groningen, GRIAC research institute, Groningen, the Netherlands, 3 University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands, 
4 University of Utrecht, University Medical Center Utrecht, Department of Pulmonology, Utrecht, the Netherlands, 5 University of Groningen, University Medical Center Groningen, 
Department of Pathology and Medical Biology, Groningen, the Netherlands, 6AstraZeneca, R&D Molndal, Molndal, Sweden, 7 AstraZeneca, Alderley Park, Macclesfield, Cheshire, United 
Kingdom, 8 Department of Pulmonology, Leiden University Medical Center, Leiden, the Netherlands, 9 Department of Respiratory Medicine, University of Amsterdam, Amsterdam, the 
Netherlands, 10 Department of Medicine, Section of Computational Biomedicine, Boston University Medical Center, Boston, Massachusetts, United States of America, 1 1 Odense University 
Hospital, Department of Respiratory Medicine J, and University of Southern Denmark, Clinical Institute, Odense, Denmark, 12 Respiratory and Allergy Research Group, Manchester 
Academic Health Sciences Centre, University Hospital South Manchester NHS Foundation Trust, Manchester, United Kingdom, 13The Copenhagen City Heart Study, Frederiksberg 
Hospital, Copenhagen University Hospital, University of Copenhagen, Copenhagen, Denmark, 1 4 Department of Clinical Biochemistry, Herlev Hospital, Copenhagen University Hospital, 
University of Copenhagen, Copenhagen, Denmark, 1 5 Department of Clinical Biochemistry, Rigshospitalet, Copenhagen University Hospital, University of Copenhagen, Copenhagen, 
Denmark, 1 6 Center for Prevention and Health Services Research, National Institute for Public Health and the Environment, Bilthoven, the Netherlands, 1 7 Julius Center for Health Sciences 
and Primary Care, UMC Utrecht, Utrecht, the Netherlands, 18 Department of Diagnostic and Interventional Radiology, University Hospital of Heidelberg, Heidelberg, Germany, 
1 9 Translational Lung Research Center (TLRC-H), Member of the German Center for Lung Research (DZL), Heidelberg, Germany, 20 Department of Public Health, Erasmus Medical Center 
Rotterdam, Rotterdam, the Netherlands, 21 Department of Respiratory Diseases, Jagiellonian University Medical College, Krakow, Poland, 22 Channing Division of Network Medicine, 
Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States of America, 23 Division of Pulmonary and Critical Care Medicine, Department of Medicine, 
Brigham and Women's Hospital, Boston, Massachusetts, United States of America, 24 Harvard Medical School, Boston, Massachusetts, United States of America, 25 Division of Pulmonary 
and Critical Care Medicine, National Jewish Health, Denver, Colorado, United States of America, 26 Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, 
Baltimore, Maryland, United States of America, 27 Cambridge Institute for Medical Research, University of Cambridge, Cambridge, United Kingdom, 28 Department of Thoracic Medicine, 
Haukeland University, Hospital and Institute of Medicine, University of Bergen, Bergen, Norway, 29 Institut universitaire de cardiologie et de pneumologie de Quebec, Department of 
Molecular Medicine, Laval University, Quebec City, Canada, 30 Division of Respirology, Department of Medicine, James Hogg Research Centre, St Paul's Hospital, University of British 
Columbia, Vancouver, BC, Canada, 31 Dept Epidemiology, Erasmus MC, Rotterdam, the Netherlands, 32The Netherlands Healthcare Inspectorate, The Hague, the Netherlands, 
33 Department of respiratory medicine, University Hospital Ghent, Ghent, Belgium, 34 Department of Internal medicine, Erasmus MC, Rotterdam, the Netherlands, 35 Netherlands 
Consortium for Healthy Aging (NCHA), Rotterdam, the Netherlands, 36 Genetic Epidemiology Unit, Department of Epidemiology, Erasmus Medical Center, University Medical Center, 
Rotterdam, the Netherlands, 37 University of Groningen, University Medical Center Groningen, Beatrix Childrens' Hospital, Department of Pediatric Pulmonology and Pediatric Allergology, 
Groningen, the Netherlands, 38 University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands 



Background: Chronic mucus hypersecretion (CMH) is associated with an increased frequency of respiratory infections, excess 
lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, 
but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing 
genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. 

Methods: (MI h analysis was performed in the NELSON-study using the lllumina 610 array, followed by replication and meta- 
analysis in 1 1 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or 
former heavy smokers (>20 pack-years). Additional studies were performed to test the functional relevance of the most 
significant single nucleotide polymorphism (SNP). 

Results: A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25x10~ 6 , 
OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein I locus {SATBI) on chromosome 3. The risk allele 
(G) was associated with higher mRNA expression of SATBI (4.3x10~ ) in lung tissue. Presence of CMH was associated with 
increased SATBI mRNA expression in bronchial biopsies from COPD patients. SATBI expression was induced during 
differentiation of primary human bronchial epithelial cells in culture. 

Conclusions: Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a c/s-eQTL for SATBI, 
together with our additional observation that SATBI expression increases during epithelial differentiation provide 
suggestive evidence that SATBI is a gene that affects CMH. 
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Introduction 

The secretion of mucus is a natural part of the airway defense 
against inhaled noxious particles and substances. Chronic mucus 
hypersecretion (CMH) is a condition of overproduction of mucus 
and defined as the presence of sputum production during at least 
three months in two consecutive years without any explaining 
origin whereas airway obstruction is not a prerequisite [1]. 
Smoking is a risk factor for CMH, i.e. the prevalence of CMH in 
the general population is reported to be 7.4% in current smokers, 
3.7% in ex-smokers and 2.4% in never smokers [2]. CMH is the 
key presenting symptom in chronic bronchitis, one of the three 
main sub-groups of chronic obstructive pulmonary disease 
(COPD), a complex disease characterized by the presence of 
incompletely reversible and generally progressive airflow limitation 
[3]. Moreover, CMH is a risk factor for the development of 
COPD [4,5]. 

Worldwide, COPD affected 65 million people in 2004 and 
more than 3 million people died of COPD in 2005, representing 
5% of all deaths. It is predicted that COPD will be the third 



leading cause of death worldwide in 2030 [6]. COPD markedly 
reduces quality of life and is responsible for high healthcare costs. 
For instance, the combined (direct and indirect) yearly costs of 
COPD and asthma in the United States of America were 
projected at $68 billion in 2008 [7]. CMH is not only associated 
with COPD but also with an increased duration and frequency of 
respiratory infections, excess decline in forced expiratory volume 
in 1 second (FEVi) and increased hospitalization and mortality 
rates in the general population [4,5,8,9] . 

It is not known why only a minority of all smokers develops 
CMH, yet a plausible explanation is the presence of a genetic 
predisposition for CMH, as evidenced by familial aggregation of 
mucus overproduction and higher prevalence of CMH in 
monozygotic than in dizygotic twins [10—12]. Littie is known 
about the identity of the genes that predispose to CMH. One 
publication suggested that CTLA4 is associated with chronic 
bronchitis in COPD [13]. 

The aim of our study was to identify genetic factors for CMH, 
thereby obtaining a better insight into the origins of this disorder. 
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Table 1. Questions used to define chronic mucus hypersecretion in the corresponding cohorts. 





Cohort 


Question 


NELSON [24] 


Do you expectorate sputum on the majority of days more than 3 months a year, even when you do not have a cold? 


Rotterdam [25,26] 


Do you expectorate sputum on the majority of days during >3 months during the last 2 years? 


LifeLines [27] 


Do you usually expectorate sputum during day or night in winter? If yes: Do you expectorate sputum on the majority of days 
>3 months a year? 


Vlagtwedde- Vlaardingen [28,29] 


Do you expectorate sputum on the majority of days >3 months a year? 


Doetinchem [30] 


Do you expectorate sputum during winter, day and night, each day for 3 months? 


Poland [31,32] 


Do you usually bring up phlegm from your chest, or do you usually have phlegm in your chest that is difficult to bring up when 
you don't have a cold? If yes: Are there months in which you have this phlegm on most days? If yes: Do you bring up this 
phlegm on most days for as much as three months each year? A positive answer to all (3) questions identifies CMH. 


Heidelberg [33] 


Do you expectorate sputum on the majority of days >3 months a year? 


GLUCOLD [15] 


Do you expectorate sputum immediately after getting up on the majority of days in winter >3 months a year? 


Rucphen [29] 


Do you expectorate sputum during day or night in winter? If yes: Do you have expectoration on the majority of days >3 
months a year? 


ECLIPSE [34] 


Do you usually bring up phlegm from your chest on getting up, first thing in the morning, during the rest of the day or at night, 
on most days for 3 consecutive months or more during the year? 


COPDGene [35] 


Do you usually bring up phlegm from your chest on getting up, first thing in the morning, during the rest of the day or at night, 
on most days for 3 consecutive months or more during the year? 


Norway [36,37] 


Do you usually bring up phlegm from your chest on getting up, first thing in the morning, during the rest of the day or at night, 
on most days for 3 consecutive months or more during the year? 



doi:1 0.1 371/journal.pone.0091 621 .t001 

Materials and Methods 

Ethics Statement 

The Dutch ministry of health and the Medical Ethics 
Committee of the hospital approved the study protocol for all 
Dutch centers. Ethics approval and written informed consent was 
obtained from all participants in all studies participating. For 
detailed information, see Supplement SI. 

Subjects and genotyping 

We performed GWA studies in participants of the NELSON- 
study (n = 3,729), a male population-based lung cancer screening 
study investigating heavy smokers (&20 pack-years) [24]. 



Table 2. Overview of populations. 



Replication of SNPs with p<10 -4 was attempted in six cohorts 
participating in 'COPD Pathology: Addressing Critical gaps, Early 
Treatment & diagnosis and Innovative Concepts' (COPACETIC) 
and in five non-COPACETIC cohorts. Caucasian subjects with 
^20 pack-years smoking with genotype-, spirometric- and 
demographic data were included. 

An overview of the CMH definitions used in this study is 
presented in Table 1 . A brief description of the included cohorts 
and details according to the period of data collection, type of 
population, genotyping platforms and genetic imputation software 
are presented in in Table 2. 



Study Data Collection Type of population Genotyping platform Imputation software 



NELSON 


2004/2005 


general population 


lllumina Quad 610 


NA 


GLUCOLD 


2005/2006 


COPD case 


lllumina Veracode 


NA 


Vlagtwedde Vlaardingen 


1989/1990 


general population 


lllumina Veracode 


NA 


Doetinchem 


1998/2002 


general population 


lllumina Veracode 


NA 


Poland 


2005/2006 


general population 


lllumina Veracode 


NA 


Heidelberg 


2004/2005 


general population 


lllumina Veracode 


NA 


Rucphen 


2002 


Family based COPD on a doctor 
diagnosis 


lllumina Veracode 


NA 


Rotterdam 


2002/2008 


general population 


lllumina 550K 


MaCH 


JfeLines 


2008/2010 


general population 


lllumina Human CytoSNP-12 


BEAGLE V3.1.0 


COPDGene 


2008/2009 


COPD case/control (stage 1— IV) 


lllumina Human Omni1-Quad 


MaCH 


ECLIPSE 


2005/2007 


COPD case/control (stage II— IV) 


lllumina Human HAP 550 V3 


MaCH 


Norway 


2003/2005 


COPD case/control (stage II — IV) 


lllumina Human HAP 550 VI, V3, 


MaCH 



and DUO 



Populations and corresponding period of data collection, type of population, genotyping platform and soft-ware used for imputation. 

NA = not applicable. 

doi:1 0.1 371/journal.pone.0091 621 .t002 



PLOS ONE | www.plosone.org 



3 



April 2014 | Volume 9 | Issue 4 | e91621 



Genetic Influence of SATB1 on Airway Disease 



GWAS: NELSON + healthy controls 
Logistic regression, 
adjustment for center 



GWAS: NELSON 
Logistic regression, 
adjustment for center and smoking 



SNPsp< 10' 

I 



SNPsp< 10" 



Genotyping of shared top-SNPs in 
replication cohorts, 
individuals > 20 packyears 



NELSON + replication cohorts 
Logistic regression, adjustment for 
sex and current smoking 



META-ANALYSIS 



Figure 1. Study design. We performed GWA studies in the NELSON cohort and in additional healthy controls. CMH was analyzed using logistic 
regression with adjustment for center (Groningen and Utrecht). Since current smoking can affect the presence of CMH, we additionally performed the 
GWAS in the NELSON cohort correcting for center and smoking. SNPs with a p-value<10-4 present in both GWA studies were selected for replication. 
To test for generalizability of associations with CMH in other populations, we compared our results with data in CMH-cases and controls with a 
smoking history of >20 pack-years with eleven replication populations using logistic regression with adjustment for sex and current smoking. Finally, 
we performed a meta-analysis on shared SNPs across the NELSON identification population and the 1 1 replication populations. 
doi:1 0.1 371 /journal.pone.0091 621 .g001 



Strategy 

We searched for SNPs associated with CMH by using a two- 
stage strategy followed by a replication stage and meta-analysis 
(Figure 1). 

Statistical analysis 

General characteristics of CMH-cases and controls were 
compared using Student's t- and Mann-Whitney-U tests for 
continuous variables and %2 tests for dichotomous variables with 
SPSS 20.0. Sample and SNP quality control (QC), regression- and 
meta-analysis were performed with PLINK 1 .07 [25] . QC criteria 
are described in Supplement SI. 

Logistic regression analysis under an additive model was used to 
identify SNPs associated with CMH. SNPs with a p-value<10~ 4 
were included for replication. When two SNPs were in strong 
linkage disequilibrium (r2&0.8), tne SNP with the lowest p-value 
was further analyzed. 

SNPs in COPACETIC cohorts and in LifeLines were analyzed 
using logistic regression with adjustment for sex and smoking (ex-/ 
current smoking). In LifeLines, imputed SNPs with an info-score 
<0.3 (imputation quality score) were removed. SNPs in non- 
COPACETIC cohorts were analyzed by the cohort investigators 
using the same model. 

Meta-analysis was performed on SNPs across NELSON and the 
1 1 replication cohorts. The Cochran's Q test was used to test for 
heterogeneity in the meta-analysis. 

We performed multivariate logistic regression analysis, adjusted 
for pack-years and lung function, to associate CMH with the risk 
allele of rs6577641 in the identification cohort. 



Functional relevance of SATB1 and rs6577641, our 
highest ranked-SNP 

We performed 4 functional studies with the identified top-SNP. 
Details on their methods are given in Supplement SI. 
We assessed: 

1) whether rs6577641 is an eQTL, by analyzing the association 
of SATB1 expression levels with rs6577641 genotypes in lung 
tissue from three independent cohorts recruited from Laval 
University, University of British Columbia, and University of 
Groningen as described previously [14]; 

2) CMH-associated mRNA expression in airway wall biopsies 
from 77 COPD participants in the GLUCOLD-study [15]; 

3) the association of homozygous genotypes for rs6577641 with 
a) immunohistochemical staining (IHC) for SATB 1 and b) the 
fraction of mucus positivity on bronchial tissue explanted from 
COPD or lung cancer subjects that underwent lung surgery; 

4) SATB 1 expression levels during mucociliary differentiation of 
primary bronchial epithelial cells cultured at air-liquid 
interface [26]. 

Results 

Populations 

Characteristics of the identification and replication populations 
are presented in Table 3. Subjects with CMH were more often 
current smokers and had worse lung function, except for 
populations including subjects with COPD only. 

Identification analysis 

After QC, 492,700 SNPs and 2,512 individuals (717 CMH 
cases, 1,795 controls) from the NELSON study remained. Logistic 
regression analysis was performed including these individuals 
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Table 3. Demographic and clinical characteristics of CMH-cases and -controls with >20 pack-years, present in the meta-analysis. 





Population 


CMH 


N 


Population % 


Female, % 


Age, yrs 
(SD) 


Pack-yea rs 
(range) 


Current 
smoking, % 


FEV1 %, 
pred. (SD) 


FEV1/FVC, % (SD) 


NELSON 


Control 


1,795 


71.5 


0 


60.2 (5.3) 


34 (21-156) 


47.5 


100.3 (17.2) 


72.9 (8.7) 


NELSON 


Case 


717 


28.5 


0 


60.4 (5.6) 


39 (21-140) 


74.2 


93.5 (20.0) 


69.2 (11.0) 


Rotterdam 


Control 


1,043 


84.1 


46.1 


68.0 (9.3) 


45 (20-149) 


40.1 


92.4 (23.5)* 


72.8 (8.7)* 


Rotterdam 


Case 


197 


15.9 


43.7 


72.0 (8.4) 


40 (20-168) 


45.2 


85.0 (26.9) # 


68.0 (11.1)* 


LifeLines 


Control 


1,431 


88.1 


80.1 


52.9 (9.2) 


27 (20-100) 


56.4 


98.2 (15.6) 


72.4 (8.2) 


LifeLines 


Case 


193 


11.5 


46.9 


53.2 (9.9) 


29 (20-97) 


75.4 


90.5 (18.0) 


68.3 (11.3) 


Vlagtwedde- 
Vlaardingen* 


Control 


234 


82.4 


27.4 


52.9 (10.1) 


29 (20-128) 


51.7 


94.5 (12.1) 


76.6 (4.5) 


Vlagtwedde- 
Vlaardingen* 


Case 


50 


17.6 


18 


53.4 (10.5) 


33 (22-83) 


68 


86.7 (18.6) 


71.0 (8.9) 


Doetinchem 


Control 


250 


80.6 


37.2 


54.7 (8.8) 


30 (20-90) 


55.6 


94.8 (17.6) 


71.5 (9.9) 


Doetinchem 


Case 


60 


19.4 


36.7 


56.4 (7.7) 


33 (20-72) 


68.3 


89.1 (19.6) 


69.3 (11.4) 


Poland 


Control 


97 


85.1 


22.7 


56.7 (10.5) 


30 (20-116) 


52.6 


96.4 (21 .4) 


72.5 (0.5) 


Poland 


Case 


17 


14.9 


11.8 


55.8 (9.4) 


35 (22-86) 


82.4 


93.5 (24.0) 


69.2 (13.1) 


Heidelberg 


Control 


608 


84.2 


35.7 


58.1 (5.2) 


37 (23-138) 


54.3 


96.4 (17.6) 


78.9 (9.7) 


Heidelberg 


Case 


114 


15.8 


29.8 


58.0 (5.2) 


37 (23-91) 


91.2 


86.2 (21 .5) 


75.3 (10.6) 


GLUCOLD** 


Control 


48 


55.2 


8.3 


62.6 (7.6) 


46 (21-182) 


62.5 


63.4 (9.8) 


50.4 (9.1) 


GLUCOLD** 


Case 


39 


44.8 


20.5 


59.6 (7.4) 


40 (22-83) 


61.5 


63.9 (8.8) 


53.1 (7.8) 


Rucphen** 


Control 


28 


53.8 


46.4 


66.5 (7.9) 


42 (21-120) 


57.1 


74.5 (15.7) 


57.2 (7.8) 


Rucphen** 


Case 


24 


46.2 


41.7 


62.2 (10.5) 


43 (21-100) 


70.8 


70.2 (21 .6) 


53.1 (9.7) 


ECLIPSE** 


Control 


961 


62 


37.5 


64.1 (6.7) 


53 (21-205) 


28.1 


48.0 (15.7) 


44.5 (11.3) 


ECLIPSE** 


Case 


590 


38 


24.1 


62.9 (7.4) 


54 (22-220) 


47.5 


46.2 (15.5) 


44.3 (11.7) 


COPDGene 


Control 


628 


71.8 


53.5 


63.1 (8.6) 


50 (21-173) 


28.2 


75.0 (28.3) 


63.7 (17.6) 


COPDGene 


Case 


247 


28.2 


40.5 


61 .9 (8.4) 


54 (21-237) 


50.2 


60.4 (27.4) 


54.6 (17.9) 


Norway 


Control 


501 


52.4 


44.9 


61.5 (10.3) 


34 (20-130) 


46.9 


71.7 (24.2) 


64.6 (15.7) 


Norway 


Case 


456 


47.6 


20.4 


64.1 (10.1) 


39 (20-119) 


59 


56.5 (24.4) 


55.0 (17.3) 



CMH = Chronic mucus hypersecretion; 
*lung function is based on FEVyiVC; 
**all individuals in this cohort have COPD; 

*based on lung function of 700 subjects who returned for follow-up study 4 years later. 
doi:1 0.1 371/journal.pone.0091 621 .t003 



supplemented with 590 additional healthy controls, adjusting for 
center. The QQ;plot provided no evidence of population 
stratification (X= 1.0185). 77 SNPs were associated with CMH 
with a p-value<10" + . CMH was associated with current smoking 
in our identification cohort (p<0.001). Therefore, we performed a 
second GWA adjusting for center and current/ex-smoking (717 
CMH-cases, 1,795 controls). The QQ_-plot showed no evidence of 
population stratification (X = 1 .0056). We observed 64 SNPs with a 
p-value<10 4 . Genome wide association for CMH ordered by 
chromosome is shown in the Manhattan plot. Figure 2 shows QQ_- 
plots (A, C) and genome wide association signals for CMH ordered 
by chromosome (Manhattan-plots, B and D) of these sequential 
analyses. We identified 36 SNPs associated with CMH with a p- 
value<10 in both analyses Table 4. Of these, 32 SNPs were 
included for replication and 4 SNPs were removed because they 
were in strong linkage disequilibrium (r 2 >0.8) with another 
associated SNP. 

Replication of associated SNPs 

Genotyping of SNP rs4775569 failed in the COPACETIC 
populations, and was removed for further analysis. CMH- 
associated top-SNPs for each cohort are presented in Table 5, 



with a complete overview in Table 6. When applying Bonferroni 
correction in the meta-analysis (p = 1.61 xlO 3 for 31 SNPs), we 
found a strong association with one SNP: 

• rs6577641, a SNP located on chromosome 3 in intron 9 of the 

special AT-rich .sequence- binding protein 1 locus (SATB1) (combined 
p-value = 4.25xl0" 6 , OR= 1.17; 1.10-1.26). 

The SATB1 SNP rs6577641 had the lowest p-value for 
association with CMH in the meta-analysis. Figure 3 shows the 
forest plot of rs6577641 in the identification and replication 
cohorts and meta-analysis. 

We assessed the percentage of subjects with CMH in each 
genotyping group for rs6577641 in NELSON-total and stratified 
for current and ex smokers (Figure 4). Multivariate logistic 
regression analysis, corrected for pack-years and FEVi %predicted, 
showed that CMH was significantly associated with the number of 
G-alleles in the 1,385 current smokers (reference = AA: heterozy- 
gous mutant (AG) p = 0.001; OR=1.50, homozygous mutant 
(GG) p = 0.001; OR =1.80) but not in 1,127 ex-smokers (refer- 
ence =AA: heterozygous mutant (AG) p = 0.380; OR =1.18, 
homozygous mutant (GG) p = 0.143; OR= 1.42). 
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Figure 2. Quantile-quantile plot and Manhattan plot of GWA results for association of SNPs with CMH in NELSON amplified with 
bloodbank controls and corrected for center (A and B). Quantile-quantile plot and Manhattan plot of GWA results for association of SNPs with 
CMH in NELSON, corrected for center and smoking habits (C and D). 
doi:1 0.1 371 /journal, pone.0091 621 .g002 



Functional relevance of SATB1 and rs6577641 

1) Transcriptional regulation of SATB1 mRNA expression 
We analyzed the association of SA TBI expression levels in lung 

tissue with rs6577641 genotype in 3 independent data sets of the 
Universities of Groningen, Laval and UBC [14]. A cis-acting effect 
of rs6577641 on SATB1 expression was identified and present in 
all three datasets (n= 1,095), with the same direction of effect 
across all three SATB1 probes on the array. The (susceptibility) G 
allele increased expression, the (protective) A allele reduced 
expression (p = 4.3xl0 -9 ) in the meta-analysis across the three 
datasets and across all three SATB1 probes measured (Table 7). 

2) SATB1 mRNA expression and CMH 

We compared SATB1 expression in baseline airway wall 
biopsies of COPD patients with (n = 38) and without (n = 39) 
CMH in GLUCOLD [15]. CMH was significantly associated with 
SATB1 expression levels (corrected for ex-/current smoking; 
p = 0.0045; Figure 5). After stratification, the same direction of 
effect was present in ex- and current smokers. However, this 
association reached statistical significance in current smokers 
(p = 0.021) and not in ex- smokers (p = 0.132), probably due to a 
difference in power as 46 subjects were current smokers versus 33 
ex-smokers. 



3) Genotype related protein expression and mucus positivity in 
bronchial epithelium 

SATB1 protein expression has previously been observed in IHS 
analysis of bronchial epithelial cells [16]. Therefore, we stained 
SATB 1 on paraffin embedded lung tissue biopsies of individuals 
from the Groningen population contributing to the eQTL 
analysis. We observed clear nuclear staining for SATB1 in 
bronchial epithelial cells. No significant difference for % of strong 
positive, positive and weak positive cells was observed between the 
protective (AA, n = 9) and risk (GG, n= 14) rs6577641 genotypes 
(11.8%±5.8 versus 12.7%±6.9, p = 0.74). 

We determined whether the fraction of mucus positive 
bronchial epithelium was different in subjects with different 
homozygous rs6577641 genotypes and performed PAS-staining 
on tissue biopsies from the same cohort. We observed no 
significant difference between individuals with the homozygous 
protective (AA, n= 10) and risk (GG, n = 7) alleles (19.7% ±11.9 
versus 14.3%±9.6, p = 0.34). 

4) SATB1 expression levels during bronchial epithelial cell 
mucociliary differentiation 

We investigated whether SATB 1 expression was induced during 
mucociliary differentiation of primary human bronchial epithelial 
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Table 4. SNPs associated with CMH with a p-value<10 4 , present in GWAS-I and in GWAS-II, in the NELSON identification cohort. 





Chromosome 


SNP 


Base pair position 


p-value GWAS 1 


p-value GWAS II 


2 


rs6735868 


103582093 


1.11 x10~ 05 


1.08xl0~ 05 


3 


rs 1387089 


1940922 


7.94 x10~ 05 


4.56 xlO~ 05 


3 


rsl 488757 


1981567 


2.17x10~ 05 


1.16x10~ 05 


3 


rs6577641 


18397849 


6.83x10~ 05 


2.57 x10~ 05 


4 


rs4306981 


79924121 


9.74 x10~ 05 


5.18x10~ 05 


8 


rs4242562 


1 1 5475287 


7.66 x10~ 05 


5.13x10~ 05 


8 


rs7836298 


1 1 5504434 


1.03 xl0~ 05 


4.37 xlO~ 06 


8 


rs7823554* 


115553109 


6.05 x10~ 05 


5.22 x10~ 05 


8 


rs7836963* 


115568426 


5.52 x10~ 05 


4.24 x10~ 05 


8 


rs16886291 


115711436 


3.54 x10~ 05 


2.09 x10~ 05 


8 


rsl 0098746 


125838127 


8.47 x10~ 05 


4.34 x10~ 05 


8 


rs7831595 


144974963 


3.08 x10~ 05 


2.32 x10~ 05 


9 


rs4842047 


138816796 


2.63 x10~ 05 


4.51 x10~ 05 


10 


rs943189 


22842590 


5.57 x10~ 05 


6.33 x10~ 05 


11 


rsl 1026531 


22379184 


2.76 x10~ 05 


8.55 x10~ 05 


12 


rsl 894307* 


12005720 


9.04 x10~ 06 


7.1 8x1 0 -06 


12 


rs2255953 


12010736 


1.13x10~ 05 


4.33 x10~ 06 


12 


rs2855708 


12013572 


6.47 x10~ 05 


3.97 x10~ 05 


12 


rsl 0879509* 


73242131 


6.98 xl0~ 06 


4.44 x10~ 05 


12 


rs4760851 


73284781 


4.85x10~ 06 


2.29 x10~ 05 


12 


rs952394 


73441110 


4.18x10~ 05 


4.22 x10~ 05 


12 


rsl 28221 99 


75458164 


4.82 xl0~ 05 


8.58 xlO~ 05 


12 


rsl 379963 


75493882 


1.18x10~ 05 


2.20 xlO~ 05 


12 


rsl 795669 


76273692 


8.01 x10~ 05 


7.86 x10~ 05 


13 


rs9578362 


21882381 


4.28 x10~ 05 


7.99 x10~ 05 


13 


rs1211304 


50381016 


9.96 x10~ 05 


1.12x10~ 05 


14 


rs992745 


27810095 


7.67 x10~ 05 


2.99 x10~ 05 


15 


rs754661 


26934277 


4.54 x10~ 05 


2.88 x10~ 05 


15 


rs4775569 


46850317 


4.20 x10~ 05 


1.72x10~ 05 


16 


rsl 3333521 


19904082 


5.08 x10~ 05 


2.50 x10~ 05 


17 


rsl 1652469 


49565797 


1.13x10~ 05 


3.80 x10~ 05 


18 


rs8086262 


69227590 


1.15x10~ 05 


2.53 x10~ 05 


20 


rs4815628 


3891896 


4.17x10~ 05 


2.15x10~ 05 


21 


rs2032257 


27774870 


3.97 x10~ 05 


5.39 x10~ 05 


22 


rsl 0091 47 


30088257 


8.41 x10~ 05 


4.51 xlO~ 05 


22 


rsl 005239 


47687170 


9.86 x10~ 05 


8.67 x10~ 05 



*SNP not selected for replication because of strong linkage disequilibrium with another SNP. 
doi:1 0.1 371/journal.pone.0091 621 .t004 



(HBE) cells in vitro and compared SATB 1 mRNA expression levels 
at different time points of an air-liquid interface (ALI) culture for 
up to 45 days. ALI culture of HBE cells induced mucociliary 
differentiation, as confirmed by induction of expression of FOXJ 1 , 
a marker for ciliated cells (19) and MUC5AC, a marker of goblet 
cells. SATB1 expression was induced over time (Figure 6), with an 
approximately 8-fold increased expression from the start to the 
end of the 45-day ALI culture period. 

Discussion 

Since not every ex- or current heavy smoker suffers from 
chronic mucus hypersecretion (CMH), we aimed to identify 



genetic variants conferring susceptibility to CMH. Therefore, we 
performed the first GWA study on CMH, the key presenting 
symptom in chronic bronchitis. CMH was associated with 36 
SNPs at the p< 10 significance level in the identification cohort. 
In the meta-analysis combining our identification and replication 
cohorts, strong association was observed with rs6577641, a SNP 
located on chromosome 3 in intron 9 of SATB1. Although the 
association of rs6577641 with CMH did not reach conventional 
genome-wide significance, its effect was in the same direction and 
was significant (4.25 xl0 -B ) at nominal levels (1.61 xlO -3 ) across 
eleven study populations, showing the robustness of this finding. 
The detected odds ratio for this SNP suggests an additional risk of 
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NELSON 
Rotterdam 
LifeLines 
Vlagtwedde Vlaardingen 
Doetinchem 
Poland 
Heidelberg 
GLUCOLD 
Rucphen 
ECLIPSE 
COPDGene 
Norway 

Fixed effect 
Random effect 



4> 



0.5 1 1.5 

Odds ratio 

Figure 3. Forest plot showing evidence of association for 
rs6577641 with chronic mucus hypersecretion in the identifi- 
cation and replication cohorts. Vertically left, the identification 
cohort and the replication cohorts included in the meta-analysis. The 
boxes represent the precision and the horizontal lines represent the 
confidence intervals. The squares represent the pooled effect estimate 
from the meta-analysis of all cohorts. The horizontal axis shows the 
scale of the effects. 

doi:1 0.1 371 /journal.pone.0091 621 .g003 

17% per G allele to develop CMH in a population of ex- and 
current heavy smokers. 

Multivariate regression analysis, stratified for current an ex- 
smoking, showed essentially the same effect sizes and direction of 
the association of CMH and the risk allele of rs6577641. It is likely 



that lack of power is the reason for not reaching the level of 
significance in ex-smokers. 

These data strongly suggest that SATB1 plays a role in the 
susceptibility to CMH in subjects with a history of heavy smoking 
(&20 pack-years) within the general population. Moreover, 
rs6577641 has a or-eQTL effect on SATB1 lung tissue expression, 
the risk allele at rs6577641 (G) increasing and the A-allele 
reducing expression oiSATBl significantly. Additionally, we found 
a higher SATB1 expression in bronchial biopsies of COPD- 
patients with CMH. We found no differences between the GG and 
AA genotypes for protein expression of SATB1 in airway 
epithelium by IHC in a small sample from our lung tissue registry. 
Finally, we demonstrate that SATB1 mRNA expression is induced 
during mucociliary differentiation in ALI cultures of human 
bronchial epithelial cells of 2 donors supporting our eQTL 
findings. Interestingly, expression of the mucin gene MUC5AC 'was 
also induced during this culture period, with a slightly delayed 
kinetics compared to SATB1. Together these data strongly suggest 
that SATB1 is induced during differentiation of bronchial epithelial 
cells and affects chronic mucus hypersecretion. 

The forest plot clearly shows that the effect of SNP rs6577641 is 
lower in cohorts including COPD patients only (GLUCOLD, 
Rucphen, COPDGene, ECLIPSE and Norway) than in the other 
cohorts. Additional meta-analysis of COPD-cohorts and general 
population based cohorts separately confirmed this (COPD 
cohorts, combined p-value = 0.236, OR = 1.07 and general 
population based cohorts, combined p-value = 5.18 xlO -7 , 
OR= 1.26). This suggests genetic heterogeneity of CMH in 
subjects with and without COPD. 

The SNP most significantly associated with CMH, rs6577641, is 
located in an intron of SATB1. SATB1 is a transcription factor and 
chromatin (re)organizer important for controlling the expression of 
many genes in a tissue or cell-type specific fashion, for instance in 
differentiating thymus T-cells [17] or differentiating skin kerati- 
nocytes [18]. Expression of SATB1 has been observed in normal 
human bronchial epithelial cells by immunohistochemistry and 
lower levels were observed in non-small lung cancer cells [16]. In 
our study, we also showed the presence of SATB1 in bronchial 
epithelial cells by IHC staining of lung tissue. However, no 
significant differences were found between patients homozygous 
for the protective and risk alleles, for either specific SATB1 
staining or for PAS staining, the latter specifically detecting mucus. 
This inability to detect a genotype effect on protein staining may 



100- 

x 

O 75' 



t/i 
t> 

CD 
XT 



total population current smokers ex smokers 



1 III III 



I I noCMH 
^■CMH 



AA AG GG 

891 1,222 399 



AA AG GG 

502 668 215 



AA AG GG 

389 554 184 



Figure 4. Percentage of subjects with chronic mucus hypersecretion (CMH) within genotypes (AA, AG and GG) of rs6577641 in the 
identification cohort (NELSON), and distributed among ex- and current smokers. 

doi:1 0.1 371 /journal.pone.0091 621 .g004 
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Table 7. Meta-analysis of the effect of rs6577641 on mRNA expression levels of SATB1 in the lung* 







Probe Gene 
Symbol 


Affymetrlx Probe ID 


Z-score Groningen 


Z-score Laval 


Z-score UBC 


Z-Score Meta- 
Analysis 


p-value Meta- 
Analysis 






N = 351 


N = 335 


N=409 






SATB1 


100148784_TGI_at 


-2.28 


-0.08 


-1.62 


-2.29 


0.022 


SATB1 


100150253_TGI_at 


-0.84 


-0.49 


-1.62 


-1.70 


0.088 


SATB1 


100305926_TGI_at 


-2.81 


-1.38 


-1.46 


-3.26 


0.001 





*To assess the effect of the SNP rs6577641 on gene expression, a Kruskal-Wallis test was performed. This test generates a p-value, but does not give a direction of the 
effect. To assess the direction of the effect, a Spearman's correlation test was performed. Next, a Z-score was calculated for each center and a meta-analysis performed 
for each of the three SATB1 probes across all centers. Finally, a meta-analysis for all three SATB1 probes was performed across all centers. This generated a Z-score of 
—5.87 and a corresponding p-value of 4.3*10~ 9 , indicating that the susceptibility G allele of the SNP rs6577641 increases SATB1 expression. 
doi:1 0.1 371/journal.pone.0091 621 .t007 



be due to lack of power, as we found a large variation in SATB 1 
and PAS protein expression in the relatively small number of lung 
tissue samples. Other explanations include possible expression 
regulation of SATB1 by smoke exposure which could be a 
dynamic process not readily detected at the protein level by any 
single-time point analysis such as IHC staining on lung biopsies. 
Alternatively SATB1 expression levels may vary throughout the 
lungs or the technique used here is not sensitive enough to detect 
relatively small differences in protein levels. To further explore the 
association of SATB1 protein and its underlying regulation, it 
would be of interest to perform longitudinal investigations on lung 
tissue samples of subjects with and without CMH, or time series of 
in vitro cultured epithelial cells from donors with a specific genotype 
and cigarette smoke exposure. This would also allow further 
studies on epigenetic regulation with methylation, microRNA or 
histone modifications. 

The lack of association between the SATB1 protein and 
rs6577641 might additionally be due to the location of mucus 
positive cells in lung tissue. Mucus is produced both by goblet cells 
and submucosal glands, which we did not investigate further. 
Normal mucus consists of 97% water and 3% solids including 30% 
mucins. In case of dysregulation of mucus production, the 



p = 0.0045 



UJ 



CO. 



0.15 



0.05 



5 -0.05 




CMH 



CMH 



Figure 5. Bronchial biopsy mRNA-expression levels of SATB1 in 
COPD patients with chronic mucus hypersecretion (n = 38) 
compared to patients without chronic mucus hypersecretion 
(n = 39). 

doi:1 0.1 371 /journal.pone.0091 621 .g005 



concentration of solids in mucus may increase up to 15%. A 
further step therefore could involve investigating mucins/ proteins 
present in mucus, e.g. MUC5AC is predominantly produced by 
goblet cells in proximal airways and MUC5B by secretory cells 
throughout the airways and by submucosal glands. 

How does SATB1 expression contribute to CMH? SATB1 is 
known to be a genome organizer, a tissue specific chromatin 
remodeling protein with a property to modifying chromatin 
architecture by formation of loops, allowing contact of condensed 
genomic DNA to regulatory transcription proteins [19]. Thus 
SATB1 can control gene expression of a series of target genes 
located within a single locus at a specific chromosomal location 
[20]. This has for instance been elegantly shown in case of 
differentiating keratinocytes [18], where Satbl expression regu- 
lates genes located in the keratinocyte-specific loci, leading to 
adaptation of a specific cell fate of the differentiating keratinocytes. 
Similarly, a mechanism by which SATB1 could contribute to 
CMH is the induction of a gene expression program during 
differentiation of bronchial epithelial cells, leading to adaptation of 
a cell fate specific for mucus producing cells in the submucosal 
glands or a goblet cell phenotype in the bronchial epithelium. 
Involvement of Satbl in pneumocyte differentiation was previ- 
ously observed by Baguma et al. in mice [21]. We observed 
induction of SATB1 expression in bronchial epithelial cells 
differentiating under ALI culture conditions. Further research will 
need to test whether a specific gene expression profile is induced 
by SATB1 expression in differentiating bronchial epithelial cells. 
SATB1 is also highly expressed in thymocytes, but absent in 
mature non-activated T cells [22]. Moreover, Satbl has been 
shown in mice to be essential for expression of T helper 2 (Th2) cells 
important in the regulation of genes encoding interleukin 4, 5 and 
13 [19]. In Satbl -deficient mice, development of thymocytes 
stopped after the CD4V CD8 + stage with deregulation of many 
genes [23]. Conversely, in case of excessive SATB 1 -production an 
excess of Th2 cells may be formed which all produce IL-13, which 
may contribute to increased mucus production. Therefore, a 
putative role of SATB 1 in T-cells for the CMH phenotype should 
not be disregarded. 

Strength of our study is the fact that we were able to replicate 
our findings in different populations, ranging from cohorts 
consisting of individuals with severe airflow limitation to cohorts 
mainly consisting of healthy smokers. There are some limitations, 
e.g. the presence of CMH was not based on actual measurements 
of the amount of sputum produced but based on questionnaires 
that were not completely similar in all study cohorts. Underre- 
porting of CMH occurs since those experiencing CMH become 
accustomed to these symptoms, believing they are smoking related 
or because they are embarrassed to admit to cough and sputum. 
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SATB1 expression in ALI 




Day in ALI culture 

Figure 6. SATB1, MUC5AC and FOXJ1 mRNA expression levels during mucociliary human airway epithelial cell differentiation (n = 2 
donors). Expression of SATB1, the identified gene in our study, MUC5AC a marker of mucus, and F0XJ1, representing ciliated cells in epithelial cell 
culture on air liquid interface. 
doi:1 0.1 371 /journal.pone.0091 621 .g006 



We demonstrated that SATB1 mRNA expression is induced 
during mucociliary differentiation in ALI cultures of HBE cells in a 
small dataset (n = 2). However, these data seem reliable as they are 
supported by eQTL data from lung tissue. Despite this drawback, 
we consistendy found evidence for association of SATB1 with 
CMH in the populations studied, showing the robustness of our 
finding. Moreover, we corroborated this finding by functional 
studies in lung tissue, airway wall biopsies of COPD patients and 
epithelial cultures. More extensive research is needed to investigate 
which factors induce SATB1 expression in airway epithelium. 

In summary, we performed identification analyses and meta- 
analyses using data from almost 7,000 participants to identify 
genes involved in susceptibility for CMH. It is remarkable that we 
found a genetic association for CMH given this phenotype is pardy 
subjectively determined and not well delineated. Moreover, 
despite cohort differences to define CMH and severity of airflow 
limitation, we found consistent effects of SNP rs6577641 on CMH. 
This confirms that the CMH phenotype, despite the fact that it is 
self-reported, is a robust phenotype irrespective of the presence or 
absence of airflow limitation. The association of rs6577641 on 
chromosome 3 at the SATB1 locus with CMH was supported by 
functional studies including gene expression findings, demonstrat- 
ing SATB1 to be associated with CMH. 

Chronic mucus hypersecretion is a bothersome symptom for 
many people, it increases in prevalence with aging and affects 
quality of life, exacerbations of symptoms due to respiratory 
infections and ultimately increases mortality. The involvement of 
SATB1 in CMH offers opportunities to better understand the 
process leading to CMH, and future development of tailored 
medicines. 
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